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Abstract 

This paper presents a multidimensional Depen- 
dency Grammar (DG), which decouples the de- 
pendency tree from word order, such that sur- 



face ordering is not determined by traversing 
the dependency tree. We develop the notion 
of a word order domain structure, which is 
linked but structurally dissimilar to the syn- 
tactic dependency tree. We then discuss the 
implementation of such a DG using constructs 
from a unification-based phrase-structure ap- 
proach, namely Lexical-Functional Grammar 
(LFG). Particular attention is given to the anal- 
ysis of discontinuities in DG in terms of LFG's 
functional uncertainty. 

1 Introduction 

Recently, the concept of valency has gained con- 
siderable attention. Not only do all linguistic 
theories refer to some reformulation of the tra- 
ditional notion of valency (in the form of 8- 
grid, subcategorization list, argument list, or 
extended domain of locality) ; there is a growing 
number of parsers based on binary relations be- 
tween words ( Eisner, 1997| ; Maruyama, 1990). 
Even theories based on phrase structure may 
have processing models based on relations be- 
tween lexical items ( [Rainbow fc Joshi, 1994| ). 

Against the background of this interest in the 
valency concept, and the fact that word order 
is one of the main difference between phrase- 
structure based approaches (henceforth PSG) 
and dependency grammar (DG), this paper will 
propose a word order description for DG and 
describe its implementation. First, we will mo- 
tivate the separation of surface order and depen- 
dency relations within DG, and make a specific 
architectural proposal for their linking. Second, 
we will briefly sketch Lexical-Functional Gram- 
mar (LFG), and then show in detail how one 



might use the formal constructs provided by 
LFG to encode the proposed DG architecture. 
Our position will be that dependency re- 



lations are motivated semantically (Tesniere, 



1959[) , and need not be projective. We argue 



for so-called word order domains, consisting of 
partially ordered sets of words and associated 
with nodes in the dependency tree. These order 
domains constitute a tree defined by set inclu- 
sion, and surface word order is determined by 
traversing this tree. A syntactic analysis there- 
fore consists of two linked, but dissimilar trees. 

The paper thus sheds light on two questions. 
A very early result on the weak generative 
equivalence of context-free grammars and DGs 
suggested that DGs are incapable of d escrib- 
ing surface word order ( Gaifman, 1965 ). This 
result has been criticised to apply only to im- 
poverished DGs which do not properly repre- 
sent formally the expressivity of contemporary 
DG variants ( Neuhaus fc Broker, 1997 ), and our 
use of a context-free backbone with further con- 
straints imposed by dependency relations fur- 
ther supports the view that DG is not a nota- 
tional variant of context-free grammar. The sec- 
ond question addressed is that of efficient pro- 
cessing of discontinuous DGs. By converting a 
native DG grammar into LFG rules, we are able 
to profit from the state of the art in context- 
free parsing technology. A context-free base (or 
skeleton) has often been cited as a prerequisite 
for practical applicability of a natural language 



grammar (Erbach & Uszkoreit, 1990), and we 
here show that a DG can meet this criterion 
with ease. 

bee. | will briefly review approaches to word 
order in DG, and Sec. ||] introduces word order 
domains as our proposal. LFG is briefly intro- 
duced in Sec. ||, and the encoding of DG within 
the LFG framework is the topic of Sec. [f| 



2 Word Order in DG 

A very brief characterization of DG is that 
it recognizes only lexical, not phrasal nodes, 
which are linked by directed, typed, binary rela- 



tions to form a dependency tree (Tesniere, 1959 
Hudson, 1993| ). If these relations are moti- 



vated semantically, such dependency trees can 
be non-projective. Consider the extracted NP 
in " Beans, I know John likes". A projective 
tree would require "Beans" to be connected to 
either "J" or "know" - none of which is concep- 
tually directly related to "Beans" . It is "likes" 
that determines syntactic features of "Beans" 
and which provides a semantic role for it. The 
only connection between "know" and "Beans" 
is that the finite verb allows the extraction of 
"Beans" , thus defining order restrictions for the 
NP. The following overview of DG flavors shows 
that various mechanisms (global rules, general 
graphs, procedural means) are generally em- 
ployed to lift the limitation of projectivity and 
discusses some shortcomings of these proposals. 



Functional Generative Description ( (Sgall| 

let al, 198€| ) assumes a language-independent 
underlying order, which is represented as a pro- 
jective dependency tree. This abstract repre- 
sentation of the sentence is mapped via order- 
ing rules to the concrete surface realization. Re- 
cently, [Kruijff (1997 ) has given a categorial-style 
formulation of these ordering rules. He assumes 
associative categorial operators, permuting the 
arguments to yield the surface ordering. One 
difference to our proposal is that we argue for a 
representational account of word order (based 
on valid structures representing word order), 
eschewing the non-determinism introduced by 
unary categorial operators; the second differ- 
ence is the avoidance of an underlying structure, 
which stratifies the theory and makes incremen- 
tal processing difficult. 



Meaning- Text Theory ([Melc'uk, 1988Q as- 
sumes seven strata of representation. The 
rules mapping from the unordered depen- 
dency trees of surface-syntactic representations 
onto the annotated lexeme sequences of deep- 
morphological representations include global or- 
dering rules which allow discontinuities. These 
rules have not yet been formally specified 
flMelc'uk & Pertsov, 1987[p.l87f) (but see the 
proposal by Rambow &: Joshi (in print )). 
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Word Grammar (WG, [Hudson (1990|) ) is 
based on general graphs instead of trees. 
The ordering of two linked words is specified 
together with their dependency relation, as 
in the proposition "object of verb follows 
it". Extraction of, e.g., objects is analyzed 
by establishing an additional dependency called 
visitor between the verb and the extractee, 
which requires the reverse order, as in "visitor 
of verb precedes it". Resulting inconsis- 
tencies, e.g. in case of an extracted object, are 
not resolved. This approach compromises the 
semantic motivation of dependencies by adding 
purely order-induced dependencies. 

Dependency Unification Grammar 

(DUG, [Hellwig (19861) ) defines a tree-like data 
structure for the representation of syntactic 
analyses. Using morphosyntactic features with 
special interpretations, a word defines abstract 
positions into which modifiers are mapped. 
Partial orderings and even discontinuities can 
thus be described by allowing a modifier to 
occupy a position defined by some transitive 
head. The approach requires that the parser 
interprets several features in a special way, and 
it cannot restrict the scope of discontinuities. 

Slot Grammar ( [McCord, 1990) ) employs a 
number of rule types, some of which are ex- 
clusively concerned with precedence. So-called 
head/slot and slot/slot ordering rules describe 
the precedence in projective trees, referring to 
arbitrary predicates over head and modifiers. 
Extractions (i.e., discontinuities) are merely 
handled by a mechanism built into the parser. 

3 Word Order Domains 

Extending the previous discussion, we require 
the following of a word order description for DG: 

• not to compromise the semantic motivation 
of dependencies, 

• to be able to restrict discontinuities to cer- 
tain constructions and delimit their scope, 

• to be lexicalized without requiring lexical 
ambiguities for the representation of order- 
ing alternatives, 

• to be declarative (i.e., independent of an 
analysis procedure), and 

to be formally precise and consistent. 
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Figure 1: Dependency Tree and Order Domains 
for [1] 
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Figure 2: Order Domain Structure for 



The subsequent definition of an order domain 
structure and its linking to the dependency tree 
satisify these requirements. 

3.1 The Order Domain Structure 

A word order domain is a set of words, general- 
izing the notion of positions in DUG. The car- 
dinality of an order domain may be restricted 
to at most one element, at least one element, 
or - by conjunction - to exactly one element. 
Each word is associated with a sequence of order 
domains, one of which must contain the word 
itself, and each of these domains may require 
that its elements have certain features. Order 
domains can be partially ordered based on set 
inclusion: If an order domain d contains word 
w (which is not associated with d), every word 
w' contained in a domain d' associated with w 
is also contained in d; therefore, d' C d for each 
d' associated with w. This partial ordering in- 
duces a tree on order domains, which we call 
the order domain structure. The order domain 
structure constitutes a projective tree over the 
input, where order domains loosely correspond 
to partial phrases. 



(1) Den Mann hat der Junge gesehen. 
the man^cc has the boy nom seen 
'The boy has seen the man.' 

Take the German example |(T)| . Its dependency 
tree is shown in Fig. HI, with word order do- 



mains indicated by dashed circles. The finite 
verb, "hat", defines a sequence of domains, 
{di,d2,ds}, which roughly correspond to the 
topological fields in the German main clause. 
The nouns and the participle each define a sin- 
gle order domain. Set inclusion gives rise to the 
domain structure in Fig. 0, where the individ- 
ual words are attached by dashed lines to their 
including domains. 

3.2 Surface Ordering 

How is the surface order derived from an or- 
der domain structure? First of all, the ordering 
of domains is inherited by their respective ele- 
ments, i.e., "Mann" precedes (any element of) 
c?2, "hat" follows (any element of) d\, etc. 

Ordering within a domain, e.g., of "hat" and 
de, or d$ and d§, is based on precedence pred- 
icates (adapting the precedence predicates of 
WG). There are two different types, one or- 
dering a word with respect to any other ele- 
ment of the domain it is associated with (e.g., 
"hat" with respect to d§), and another or- 
dering two modifiers, referring to the depen- 
dency relations they occupy (d$ and d%, refer- 
ring to subj and vpart). A verb like "hat" in- 
troduces three precedence predicates, requiring 
other words (within the same domain) to follow 
itself and the participle to follow subject and 
object, resp.:[] 

"hat" => <* 

A subj < vpart 
A obj < vpart 

Informally, the first conjunct is satisfied by 
any domain in which no word precedes "hat", 
and the second and third conjuncts are satisfied 
by any domain in which no subject or object 
follows a participle (vpart). The obj must be 
mentioned for "hat", although "hat" does not 
directly govern objects, because objects may be 
placed by "hat", and not their immediate gov- 
ernors. The domain structure in Fig.^ satisfies 
these restrictions since nothing follows the par- 
ticiple, and because "den Mann" is not an ele- 
ment of di, which contains "hat" . This is an im- 
portant interaction of order domains and prece- 
dence predicates: Order domains define scopes 
for precedence predicates. In this way, we take 
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1 For more details on the ex act syntax and the seman- 
tics of these propositions, see flBroker, 1998b| ). 



into account that dependency trees are natter 
than PS-based ones^J and avoid the formal in- 
consistencies noted above for WG. 

3.3 Linking Domain Structure and 
Dependency Tree 

Order domains easily extend to discontinuous 
dependencies. Consider the non-projective tree 
in Fig.||. Assuming that the finite verb gov- 
erns the participle, no projective dependency 
between the object "den Mann" and the par- 
ticiple "gesehen" can be established. We al- 
low non-projectivity by loosening the linking be- 
tween dependency tree and domain structure: 
A modifier (e.g., 11 Mann") may not only be in- 
serted into a domain associated with its direct 
head ("gesehen"), but also into a domain of a 
transitive head ("hat"), which we will call the 
positional head. 

The possibility of inserting a word into a do- 
main of some transitive head raises the ques- 
tions of how to require continuity (as needed 
in most cases), and how to limit the distance 
between the governor and the modifier. Both 
questions will be solved with reference to the 
dependency relation. From a descriptive view- 
point, the syntactic construction is often cited 
to determine the possibility and scope of discon- 
tinuities ( |Bhatt, 1990| ; |Matthews, 198l|) . In PS- 
based accounts, the construction is represented 
by phrasal categories, and extraction is lim- 
ited by bounding nodes (e.g., Haegeman 

71993 ), 

Becker et al. ( 199~I| )). In dependency-based ac- 



counts, the construction is represented by the 
dependency relation, which is typed or labelled 
to indicate constructional distinctions which are 
configurationally defined in PSG. Given this 
correspondence, it is natural to employ depen- 
dencies in the description of discontinuities as 
follows: For each modifier, a set of dependency 
types is defined which may link the direct head 
and the positional head of the modifier ("gese- 
hen" and "hat", respectively). If this set is 
empty, both heads are identical and a contin- 
uous attachment results. The impossibility of 
extraction from, e.g., a finite verb phrase fol- 
lows from the fact that the dependency embed- 
ding finite verbs, propo, may not appear on any 



2 Note that each phrasal level in PS-based trees de- 
fines a scope for linear precedence rules, which only apply 
to sister nodes. 



path between a direct and a positional head. 

4 A Brief Review of LFG 

This section introduces key concepts of LFG 
which are of interest in Sec. || and is necessarily 
very short. Further information can be found in 
Bresnan fc Kaplan (1982] ) and Dalrymple et al 



(1995|) . 



LFG posits several different representation 
levels, called projections. Within a projection, 
a certain type of linguistic knowledge is repre- 
sented, which explains differences in the formal 
setup (data types and operations) of the projec- 
tions. The two standard projections, and those 
used here, are the constituent (c-) structure and 
the functional (f-) structure ( [Kaplan (1995| ) and 
Halvorsen Sz Kaplan (1995[ ) discuss the projec- 
tion idea in more detail). C-structure is defined 
in terms of context-free phrase structure rules, 
and thus forms a projective tree of categories 
over the input. It is assumed to encode lan- 
guage particularities with respect to the set of 
categories and the possible orderings. The f- 
structure is constructed from additional annota- 
tions attached to the phrase structure rules, and 
has the form of an attribute- value matrix or fea- 
ture structure. It is assumed to represent more 
or less language-independent information about 
grammatical functions and predicate-argument 
structure. In addition to the usual unification 
operation, LFG employs existential and nega- 
tive constraints on features, which allow the for- 
mulation of constraints about the existence of 
features without specifying the associated value. 

Consider the following rules, which are used 
for illustration only and do not constitute a 
canonical LFG analysis. 



S => NP 

(tOW)=i 
(|CASE)=acc 



NP 



VP 



Det 
(TspecHI 

v 

T=I 
(|tense) 



VP 

T=l 

N 

NP V 

(|SUBJ) = | (|VCOMP) = | 

(|CASE)=nom ~(|tense) 



Assuming reasonable lexical insertion rules, 
the context-free part of these rules assigns the 
c-structure to the left of Fig. || to example 
1). The annotations are associated with right- 
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hand side elements of the rules and define the 




CASE acc 
SPECIE J 

CASE nom 

sPEc^r n 



TENSE L J 

VCOMP L H 

■ ■ ^ 

Den Mann hat der Junge gesehen 

Figure 3: C-structure (left) and f-structure (right) for (T) 



f-structure of the sentence, which is displayed 
to the right of Fig. [|. Each c-structure node 
is associated with an f-structure node as shown 
by the arrows. The f-structure node associated 
with the left-hand side of a rule may be accessed 
with the | metavariable, while the f-structure 
node of a right-hand side element may be ac- 
cessed with the I metavariable. The mapping 
from c-structure nodes to f-structure nodes is 
not one-to-one, however, since the feature struc- 
tures of two distinct c-structure nodes may be 
identified (via the f=J, annotation), and ad- 
ditional embedded features may be introduced 
(such as CASE). Assuming that only finite verbs 
carry the TENSE feature, the existential con- 
straint (J.TENSE) requires a finite verb at the 
beginning of the VP, while the negative con- 
straint ~(J,TENSE) forbids finite verbs at the 
end of the VP. Note that unspecified feature 
structures are displayed as [ ] in the figure, and 
that much more information (esp. predicate- 
argument information) will come from the lexi- 
cal entries. 

Another important construct of LFG is func- 
tional uncertainty ( Kaplan &: Zaenen , 1995 



Kaplan fc Maxwell, 199*5] )^ Very often (most 
notably, in extraction or control constructions) 
the path of f-structure attributes to write down 
is indeterminate. In this case, one may write 
down a description of this path (using a regu- 
lar language over attribute names) and let the 
parser check every path described (possibly re- 
sulting in ambiguities warranted by f-structure 
differences only). Our little grammar may be 
extended to take advantage of functional un- 
certainty in two ways. First, if you want to 
permute subject and object (as is possible in 
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German), you might change the S rule to the 
following: 

S => NP VP 

(T{OBJ I SUBJ}) = | T=l 

The f-structure node of the initial NP may 
now be inserted in either the OBJ or the SUBJ 
attribute of the sentence's f-structure, which is 
expressed by the disjunction {OBJ|SUBJ} in 
the annotation. (Of course, you have to restrict 
the CASE feature suitably, which can be done 
in the verb's subcategorization.) The other reg- 
ular notation which we will use is the Kleene 
star. Assume a different f-structure analysis, 
where the object of infinite verbs is embedded 
under VCOMP. The S rule from above would 
have to be changed to the following: 

S => NP VP 

(T{(VGOMP) OBJ I SUBJ}) = | t=4 

But this rule will only analyse verb groups 
with zero or one auxiliary, because the VCOMP 
attribute is optional in the path description. 
Examples like Den Mann will der Junge gese- 
hen haben with several auxiliaries are not cov- 
ered, because the main verb is embedded under 
(VCOMP VCOMP). The natural solution is to 
use the Kleene star as follows, which allows zero 
or more occurrences of the attribute VCOMP. 

S => NP VP 

(T{VC0MP* OBJ | SUBJ}) = | \=l 

A property which is important for our use of 
functional uncertainty is already evident from 
these examples: Functional uncertainty is non- 
constructive, i.e., the attribute paths derived 



from such an annotation are not constructed 
anew (which in case of the Kleene star would 
lead to infinitely many solutions), but must al- 
ready exist in the f-structure. 

5 Encoding DG in LFG 

5.1 The Implementation Plattform 

The plattform used is the Xerox Lin- 
guistic Environment (XLE, see also 



VALENCY (_o _d _c) = { _o = opt 

(3) ~ (T " d) 

W | (T-d CLASS) = _c 

(f-d LEXEME) }. 

(4) @ (VALENCY req OBJ N) . 
5.2 Topological fields 

As we have seen in Sec. [3|, the order domain 
jructure is a projective tree over the input. So 
it is natural to encode the domain structure in 
context-free rules, resulting in a tree as shown 
in Fig. |j. Categories which have a status as or- 
der domains are named dom*, to be distinguish- 
able from preterminal categories (such as Vf in, 
I, . . . ; these cannot be converted to metacate- 
gories). As notational convention, domC will be 
the name of the (meta) category defining the or- 
der domain sequence for a word of class C. Elim- 
inating the preterminal categories yields exactly 
the domain structure given in Fig. ^. 

A complete algorithmic description of how to 
derive phrase-structure rules from order domain 
definitions would require a lenghty introduction 
to more of XLE's c-structure constructs, and 
therefore we illustrate the conversion with hand- 
coded rules. For example, a noun introduces 
one order domain without cardinality restric- 
tions. Assuming a metacategory DOMAIN stand- 
ing for an arbitrary domain, we define the fol- 
lowing rules for the domain sequences of nouns, 
full stops, and determiners: 



http : //www. pare . xerox . com/istl/groups/nltt/xle^ t.ructure 



which implements a large part of LFG theory 
plus a number of abbreviatory devices. It 
includes a parser, a generator, support for 
two-level morphology and different types of 
lexica as well as a user-friendly graphical 
interface with the ability to browse through 
the set of analyses, to work in batch mode for 
testing purposes, etc. 

We will be using two abbreviatory devices be- 
low, which are shortly introduced here. Both do 
not show up in the final output, rather they al- 
low the grammar writer to state various general- 
izations more succintly. The first is the so-called 
metacategory, which allows several c-structure 
categories to be merged into one. So if we 
are writing |(2)| , we introduce a metacategory 
domVfin (representing the domain sequence of 
finite verbs) to be used in other rules, but we 
will never see such a category in the c-structure. 
Rather, the expansion of the metacategory is di- 
rectly attached to the mother node of the meta- 
category (cf. Fig. 0). 



(2) domVfin = domlNITIAL domMIDDLE domFINAL. 

The second abbreviatory construct is the tem- 
plate, which groups several functional annota- 
tions under one heading, possibly with some 
parameters. A very important template is the 
VALENCY template defined in (3), which defines 
a dependency relation on f-structure (see be- 
low for discussion). We require three parame- 
ters (each introduced by underscore), the first 
of which indicates optionality (opt vs. req val- 
ues), the second gives the name of the depen- 
dency relation, and the third the word class re- 
quired of the modifier. (4) shows a usage of a 
template, which begins with an @ (at) sign and 
lists the template name with any parameters 
enclosed in parentheses. 



(5) domN =>■ DOMAIN* N DOMAIN*, 
doml =^ DOMAIN I. 
domD => D. 



A complex example is the finite verb, which 
introduces three domains, each with different 
cardinality restrictions. This is encoded in the 
following rules: 



(6) 



domVfin = domlNITIAL domMIDDLE domFINAL. 
domlNITIAL => DOMAIN. 
domMIDDLE DOMAIN* Vfin DOMAIN*. 
domFINAL => ( DOMAIN ) . 
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Note the use of a metacategory here, which 
does not appear in the c-structure output (as 
seen in Fig. ||), but still allows you to refer to 
all elements placed by a finite verb in one word. 



ROOT 


220 




doml 


213 



OS 1: 



domlNITIAL : 159 domMIDDLE : 189 dojflFINAL : 195 1:117 

donLN:149 Vfin:42 doinN:175 domVpp:188 . :118 

doi(LD:145 N:28 hat:43 domD:173 N:74 Vpp:108 



D:l Mann:29 
den: 2 



D:52 Junge : 75 gesehen:109 
der:51 



Figure 4: C-structure for (T) 



The definition of DOMAIN is trivial: It is just a 
metacategory expandable to every domain:^ 



(7) DOMAIN = { domVfin 
domD }. 



doml I domN 



5.3 Valencies and Dependency 
Relations 

The dependency tree is, at least in our ap- 
proach, an unordered tree with labelled rela- 
tions between nodes representing words. This 
is very similar to the formal properties of the f- 
structure, which we will therefore use to encode 
it. We have already presented the VALENCY tem- 
plate in |3j and will now explain it. {•••!■■■} 
represents a disjunction of possibilities, and the 
parameter _o (for optionality) controls their se- 
lection. In case we provide the opt value, there 
is an option to forbid the existence of the de- 
pendency, expressed by the negative constraint 
~(|_d). Regardless of the value of _o, there is 
another option to introduce an attribute named 
_d (for dependency) which contains a CLASS at- 
tribute with a value specified by the third pa- 
rameter, _c. The existential constraint for the 
LEXEME attribute requires that some other word 
(which specifies a LEXEME) is unified into the 
feature _d, thereby filling this valency slot. The 
use of a defining constraint for the CLASS at- 
tribute constructs the feature, allowing non- 



3 A number of efficiency optimizations can be di- 
rectly compiled into these c-structure rules. Mentioning 
DOMAIN is much too permissive in most cases (e.g., within 
the NP), and can be optimized to allow only domains in- 
troduced by words which may actually be modifiers at 
this point. 



constructive functional uncertainty to fill in the 
modifier (as explained below). 



A typical lexical entry is shown in (8), where 
the surface form is followed by the c-structure 
category and some template invocations. These 
expand to annotations defining the CLASS and 
LEXEME features, and use the VALENCY template 
to define the valency frame. 



hat Vfin @(Vfin aux-perf ect_) 
(8) @ (VALENCY req SUB J N) 

<§ (VALENCY req VP ART Vpp) . 

5.4 Continuous and Discontinuous 
Attachment 

So far we get only a c-structure where words 
are associated with f-structures containing va- 
lency frames. To get the f-structure shown in 
Fig. |H (numbers refer to c-structure node num- 
bers of Fig. we need to establish dependency 
relations, i.e., need to put the f-structures asso- 
ciated with preterminal nodes together into one 
large f-structure. Establishing dependency re- 
lations between the words relies heavily on the 
mechanism of functional uncertainty. First, we 
must identify on f-structure the head of each 
order domain sequence. For this, we annotate 
in every c-structure rule the category of the 
head word with the template @(HEAD), which 
identifies the head word's f-structure with the 
order domain's f-structure (cf. |(9)[ ). Second, 
all other c-structure categories (which represent 
modifiers) are annotated with the ©(MODIFIER) 

m 



template defined in 



This template states 
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that the f-structure of the modifier (referenced 
by |) may be placed under some dependency at- 
tribute path of the f-structure of the head (ref- 
erenced by |). These paths are of the form p d, 
where p is a (possibly empty) regular expression 
over dependency attributes, and d is a depen- 
dency attribute, d names the dependency rela- 
tion the modifier finally fills, while p describes 
the path of dependencies which may separate 
the positional from the direct head of the mod- 
ifier. The MODIFIER template thus completely 
describes the legal discontinuities: If p is empty 
for a dependency d, modifiers in dependency d 
are always continuously attached (i.e., in an or- 
der domain defined by their direct head). This 
is the case for the subject (in dependency SUBJ) 
and the determiner (in dependency SPEC 



m 



ORDER 



RELFLAG [] 

REL -. FIELD middle 



SUE J 



ORDER [REL -, FIELD middle] 
SPEC 51 [LEXEME definite, CLASS D, CASE nom] 
74 [LEXEME Junge_, CLASS N, CASE nom 

PROPO ORDER [REL -, FIELD middle] 

[FIELD initial 
PEL [42-ORDER-RELFLAG] 

spec l [Lexeme definite class d, case ace] 

28 ^EXEME Mann_, CLASS N, CASE ace 
108|LEXEME sehen_, CLASS Vpp 
42|leXEME aux-perfect_, CLASS Vfin 
117|lEXEME aussage_, CLASS I 



VP ART 



ORDER 



OBJ 



Figure 5: F-structure for (T) 



this example. On the other hand, a non-empty 
path p allows the modifier to 'float up' the de- 
pendency tree to any transitive head reachable 
via p. In our example, objects depending on 
participles may thus float into domains of the fi- 
nite verb (across VP ART dependencies), and rel- 
ative clauses (in dependency RELA) may float 
from the noun's domain into the finite verb's 
domains. 



(9) HEAD = |=f. 



MODIFIER = |=(t{PR0P0 
ISUBJ 

IVPART* OBJ 
I VPART 
I SPEC 

|{SUBJ|OBJ|VPART}* RELA}). 



(10) 



The grammar defined so far overgenerates 
in that, e.g., relative clauses may be placed 
into the middle field. To require place- 
ment in specific domains, additional features 
are used, which distinguish topological fields 
(e.g., via QFIELD) = middle annotations on c- 
structure). A relative clause can then be con- 
strained to occur only in the final field by adding 
constraints on these features. This mechanism 
is very similar to describing agreement or gov- 
ernment (e.g., of case or number), which also 
uses standard features not discussed here. With 
these additions, the final rules for finite verbs 
look as follows: 
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domlNITIAL => DOMAIN : @ (MODIFIER) 

(J.FIELD) = initial. 
domMIDDLE Vf in:@(HEAD) 

(J.FIELD) = middle; 
DOMAIN* :@(M0DIFIER) 

(|FIELD) = middle; 
domFINAL ( DOMAIN : @ (MODIFIER) 

(|FIELD) = final ) . 



5.5 Missing Links 

As is to be expected if you use something for 
purposes it was not designed to be used for, 
there are some missing links. The most promi- 
nent one is the lack of binary precedence predi- 
cates over dependency relations. There is, how- 
ever, a close relative, which might be used for 
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implementing precedence predicates. Zaenen Sz 
[Kaplan (1995| ) introduced f-precedence < f into 
LFG, which allows to express on f-structure con- 
straints on the order of the c-structure nodes 
mapping to the current f-structure. So we might 
write the following annotations to order the fi- 
nite verb with respect to its modifiers, or to 
order subject and object. 



(12) (|) </ (T{SUBJ|OBJ|VPART}) . 
(TSUBJ) < f (tOBJ) . 

The problem with f-precedence, however, is 
that is does not respect the scope restrictions 
which we defined for precedence predicates. 
I.e., a topicalized object is not exempt from 
the above constraints, and thus would result 
in parsing failure. To restrict the scope of f- 
precedence to order domains (aka, certain c- 



structure categories) would require an explicit 
encoding of these domains on f-structure. 

6 Conclusion 

We have presented a new approach to word 
order which preserves traditional notions (se- 
mantically motivated dependencies, topological 
fields) while being fully lexicalized and formally 
precise flBroker, 1997|) . Word order domains are 
sets of partially ordered words associated with 
words. A word is contained in an order domain 
of its head, or may float into an order domain 
of a transitive head, resulting in a discontin- 
uous dependency tree while retaining a projec- 
tive order domain structure. Restrictions on the 
floating are expressed in a lexicalized fashion in 
terms of dependency relations. We have also 
shown how the order domains can be used to 
define a context-free backbone for DG, and used 
a grammar development environment for anno- 
tated phrase-structure grammars to encode the 
DG. 

A number of questions immediately arise, 
some of which will hopefully be answered un- 
til the time of the workshop. On the theoreti- 
cal side, this work has argued for a strict sep- 
aration of precedence and categorial informa- 
tion in LFG (or PSG in general, see ( |Broker, 
1998a| )). Can these analyses and insights be 
transferred? On the practical side, can the 
conversion we sketched be used to create effi- 
cient large-scale DGs? Or will the amount of f- 
structural indeterminacy introduced by our use 
of functional uncertainty lead to overly long pro- 
cessing? And, last and most challenging, when 
will the first large treebank with dependency 
annotation be available, and will it be derived 
from XLE's f-structure output? 
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